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Abstract — Polar codes, introduced by Arikan, achieve sym- 
metric capacity of any discrete memoryless channels under 
low encoding and decoding complexity. Recently, non-binary 
polar codes have been investigated. In this paper, we calculate 
error probability of non-binary polar codes constructed on the 
basis of Reed-Solomon matrices by numerical simulations. It 
is confirmed that 4-ary polar codes have significantly better 
performance than binary polar codes on binary-input AWGN 
channel. We also discuss an interpretation of polar codes in terms 
of algebraic geometry codes, and further show that polar codes 
using Hermitian codes have asymptotically good performance. 

I. Introduction 

Arikan (T) proposed polar codes as codes that achieve 
symmetric capacity of arbitrary binary-input discrete memo- 
ryless channels (DMCs) under low-complexity encoding and 
decoding. Asymptotic error probability of polar codes has been 
studied in detail by Arikan and Telatar [2], who showed that 
the error probability is o(2~ N ) for any /3 < 1/2 and co(2~ N ) 
for any J3 > 1/2, where N is the blocklength. Although polar 
codes are constructed on the basis of a Kronecker power of the 

matrix 1 1 in Arikan's original proposal, one can extend 

polar codes using a different matrix. Korada, §asoglu, and 
Urbanke discussed such extensions to improve the threshold 
of j3, which equals 1/2 when polar codes are constructed 
using the 2x2 matrix mentioned above, and showed that 
the threshold can indeed be made larger by using a larger 
matrix |3). 

Another important direction of extending polar codes is 
to consider non-binary input alphabet. §a§oglu, Telatar, and 
Arikan considered non-binary polar codes and showed that 
polar codes achieve symmetric capacity when the size of input 
alphabet is a prime [4]. They also showed that one can still 
obtain capacity-achieving codes even when the size of input 
alphabet is not a prime, by decomposing the original channel 
to multiple channels, each of which has input alphabet whose 
cardinality is a prime, and by using a polar code for each 
of these channels [5|. This method of decomposition is also 
known as multilevel coding |6|. 

In 0, the authors discussed the case in which the size of 
input alphabet is an integer power of a prime, and showed that 
polar codes defined on the input alphabet can achieve, without 
the decomposition, symmetric capacity and that use of a larger 
matrix can improve the asymptotic error probability, similarly 



to the binary-input case. Furthermore, it is shown that Reed- 
Solomon matrices can be regarded as a natural generalization 
of the binary 2x2 matrix, providing a family of polar codes 
with various nice properties. 

In this paper, we calculate error probability of non-binary 
polar codes using Reed-Solomon matrices on the q-ary erasure 
channels by numerical simulations. We further show that 
polar codes using Hermitian codes have asymptotically good 
performance. 

II. Non-Binary Polar Codes 

Assume that q is an integer power of a prime. Let W : ¥ q — > 
y be a g-ary DMC and G be an £ x £ matrix on F q . An £" x £" 
matrix G(n is defined as Gg* := G®", where ® is the Kronecker 
power. Let k" 1 be a row vector (uq, . . . ,w^>_i) and uj be its 
subvector («;, . . . ,Uj). Let uj- be a subvector (u/ , ■ ■ ■ ,Uf ml ) of 



Uq { where T = {/ , . . . ,/m-i} C {0, . . . , 
the complement of T. Let W r : ¥f ->• y 
defined as W e \y^ \ u^ 1 ):-- 



-1}. Let T c be 
denote the DMC 

nflo'^^ I «;)■ The £-ary bit- 



reversal matrix R(n of size £" is a permutation matrix defined 
by u Q Rp, — (m, , . . . ,u r(ni ) where £-ary expansion ci\ ■ ■ -a n 
of i and £-ary expansion a n ---a\ of r, are the reversals of 
each other. The encoder of polar codes is defined as (j)(ujrc) : = 
ui RpiGp where the all-zero vector is assigned to ujr. The 
matrix G is called a kernel of the polar code. The decoder 
of polar codes is a successive cancellation (SC) decoder. For 



i G {0, . . . J" - 1}, u' - [ e F< and y\ ~ l e Y , let 
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where C/i ' and Tq ' are random variables which obey the 
distribution 
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An output ui of the decoder is determined sequentially from 

uq to il(n_i as 



r o, 



if i e T 



Uj - 
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In order to obtain polar codes of small error probability, T c 
have to be chosen such that Pl'J is small if i G T c . The 



M 



error probabilities {Pm} can be calculated by using density 
evolution (8). 

III. Exponent of Polar Codes and Reed-Solomon 

KERNEL 

A. Exponent of polar codes 

Asymptotic performance of polar codes is determined by 
a kernel G. Arikan and Telatar showed that when q — 2 and 



for a G [0, 1]. Then, the equality 

h(co(a)) + log 2 (q- l)ffl(a) = alog 2 <? 

holds for any < co(a) < 1/2, where h(-) is the binary entropy 
function. Hence, co(a) = if and only if a = 0. The best 
exponent L(q,£) is bounded from below as 



l e-i 



;=() 



, the error probability of polar codes is o(2 2 ) 
,-2/S 



for any /3 < 1/2 and m{2-^ ) for any j3 > 1/2 [2|. Korada, 

§asoglu, and Urbanke [3| generalized the result to any G when 

q = 2, showing that there exists a function E(G) G [0, 1) such 

that the error probability of polar codes is o(2~ l ) for any /3 < 

E(G) and Co(2- eP ") for any j3 > E(G). The threshold E(G) 

of j8 is called the exponent of a kernel G. They also showed g Reed-Solomon kernel 

that max 



>- £ io g ,A- 

>-(l-a)«og^D rc ^ 

= (l-a)(l+lo a (/> r<rfl /f)) ) 

where I>|- a f|/£ approaches a nonzero limit 0)(a) as I — > °° for 
any fixed < a < 1. Hence, limin^ 0O L(^,£) > 1 — a for any 
fixed0<a<l. ■ 
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>E(G) converges to 1 as £ — > °°. They further 
proposed an explicit construction method of G using BCH 
codes, for which the exponent E(G) can be made arbitrarily 
close to 1 as £ becomes large |9|. 

In (7), the authors showed that the result about exponent 
can further be generalized to g-ary polar codes. The exponent 
E(G) of a kernel G can easily be calculated for q being equal 
to an integer power of a prime, as follows. 

Theorem 1 ([3§): 



t-\ 



E(G) = -^og e Di 



(=0 

where the partial distance D, is defined as 

D t := min d ((Of,- 1 , 0.vfc^G, (Off 1 , l.Of+^G 

In the above definition of D,, d(x,y) denotes the Hamming 

distance between x and y. 

Definition 2: L(q,£) :— max„ j x i E(G). 

As in the case q = 2 [3|, the best exponent L(q,£) can be lower 

bounded by using the Gilbert- Varshamov-like bound. 

Lemma 3: 
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where 



A:=maxiz)GN| £ (){q - -li'-: t{ 
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Corollary 4: 



Proof: Let 



limL(^,£) = l 
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Generator matrices of Reed-Solomon codes are considered 
suitable as a kernel of polar codes since they have the 
following two properties: (1) Low-rate Reed-Solomon codes 
are subcodes of Reed-Solomon codes with higher rates. (2) 
Minimum distance of Reed-Solomon codes coincides with the 
Singleton bound. From these properties, for any I G {2, . . . ,q}, 
one can obtain the q-ary matrix G^s,(q,£) of size £ x £ 
whose submatrix consisting of j-th row to (£— l)-th row is 
a generator matrix of [£,£ — i, i + 1] 9 Reed-Solomon code. 
We call the q-ary matrix Grs(<?,^) the Reed-Solomon matrix. 
From Theorem^ E(G RS (q,£)) =\og(£l)/(£log£). The second 
property mentioned above guarantees the optimality of the 
Reed-Solomon matrix Grs(^,^) as a kernel of polar codes, 
yielding L(q,£) = log(£!)/(€log£) for all £ < q. One obtains, 
for example, L(4,4) « 0.57312, while in the binary case 
1(2,31) < 0.55 and 1(2,16) « 0.51828 0. This example 
demonstrates efficiency of using a non-binary kernel in con- 
structing polar codes even for a binary-input DMC. 

The original binary 2x2 matrix can be identified as 
Grs(2,2). This implies that G^s(q,q) can be regarded as a 
natural generalization of the binary 2x2 matrix. In Subsection 
IV-AI we see relation between polar codes using Gns(q,q) 
and q-ary Reed-Muller codes, which has been mentioned by 
Arikan in the binary (q = 2) case Q]. 

IV. Numerical Simulation Results on the ^-ary 
Erasure Channel 

A. Recursive calculation of error probability on the q-ary 
erasure channel 

In this section, we evaluate performance of polar codes 
constructed on the basis of the Reed-Solomon matrices. We 
consider the g-ary erasure channel for simplicity. For e G 
(0, 1), the g-ary erasure channel W : ¥ q — > ¥ q U {*} is defined 



as 



£, if y = * 

W[y\x):={l-e, ify=x 
0, otherwise 



for any x G F ? . When a Reed-Solomon matrix is used as a 
kernel of polar codes, PfJ can be calculated recursively. Since 
the Reed-Solomon codes are maximum distance separable 
(MDS) codes, Reed-Solomon codes are correctable if and only 
if the number of erased symbols is smaller than the minimum 
distance. From this observation, one obtains the recursion 
formula 



pi i. 



{al+b) _ 



r e"-< \ 



1-P. 



(«) 



gn-l 



l-i 



i=b+\ 

for < a < I"- 1 - 1 and < b < £- 1. 

B. Numerical simulation results 

In this subsection, simulation results of 2 m -ary polar codes 
are shown. The blocklength of 2 m -ary polar codes is m£ n 
viewed as binary codes where £ is the size of a kernel G 
and a submatrix of G®" is used for generator matrix of polar 
codes. 

Error probabilities of binary, 4-ary, and 16-ary polar codes 
using Gns(q,q) on the q-ary erasure channels are shown in 
Fig. Q] Blocklengths of the binary and 4-ary polar codes are 
2 15 viewed as binary codes. Blocklength of the 16-ary polar 
code is 2 14 viewed as a binary code. These results imply that 
error probability of polar codes using G^s(q,q) for a large q 
is also small in practical blocklength, although it should be 
noted that in Fig. Q]the binary, 4-ary, and 16-ary polar codes 
are simulated on different channels, namely, the binary, 4-ary, 
and 16-ary erasure channels, respectively. 

Blockwise independent q-ary channels of blocksize £ can be 
viewed as a single q e -ary memoryless channel, so that one can 
achieve capacity with q l -ary polar codes. One can alternatively 
use q-ary polar codes with a kernel of size multiple of £, 
and still provably achieve capacity. More precisely, a kernel 
of size multiple of £ is required only in the first channel 
transform 0, (6). Two binary subchannels constructed from 
the 4-ary erasure channel of erasure probability e by the 
channel transform with Grs(2,2) are both the binary erasure 
channels of erasure probability e. Hence, the error probability 
of the binary polar codes using Grs(2,2) on the 4-ary erasure 
channel is equal to the error probability of the binary polar 
codes using Grs(2,2) of the same rate and half blocklength 
on the binary erasure channel. Hence, Fig. Q] shows that the 
4-ary polar codes have significantly better performance than 
the binary polar codes on the 4-ary erasure channel. 

Simulation results on binary-input AWGN channel are 
shown in Figs. [2] and [3] The standard deviation of noise is 
0.978 65. The capacity of the binary-input AWGN channel 
is about 0.5. In Figs. |2] and [5] the binary expansion (F4 = 
{0, 1, a, a 2 }) ^(F 2 = {00, 01, 10, 11}) defined as 0^00,1^ 
01, a —¥ 10, a 2 — >• 11 is used for assigning 4-ary symbols to 
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Fig. 1. Performance comparison of q-ary polar codes on Grs(<7,<?) on the 
q-ary erasure channels. Blocklengths of the binary and 4-ary codes are 2 15 
viewed as binary codes. Blocklength of the 16-ary code is 2 14 viewed as a 
binary code. Erasure probabilities of all channels are 0.5. 
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Fig. 2. Performance comparison of binary polar codes on Grs(2.2) and 
4-ary polar codes on Grs(4, 2) on binary-input AWGN channel. Blocklengths 
are 2 , 2 , 2 , and 2 viewed as binary codes. The results for 4-ary polar 
codes and binary polar codes are plotted by solid curves and dotted curves, 
respectively. 
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Fig. 3. Performance comparison of binary polar codes on Grs(2.2) and 
4-ary polar codes on Grs (4,4) on binary-input AWGN channel. Blocklengths 
are 2 7 , T? , 2", and 2 13 viewed as binary codes. The results for 4-ary polar 
codes and binary polar codes are plotted by solid curves and dotted curves, 
respectively. 



inputs of the binary-input AWGN channels. In order to avoid 
high computational complexity of multi-dimensional density 
evolution, {Pm} is evaluated by numerical simulation. The 
sums £j_o?|„, which are upper bounds of error probabilities 



wj 



is the 



are empirically evaluated and plotted, where 
sorted version of {Pp,} according to their magnitudes. The 
upper bound is considered to be tight if rate is not close 
to the capacity Jjf|. In Fig. |2] the binary polar codes using 
Grs(2,2) and 4-ary polar codes using Grs(4,2) are simulated. 

Instead of the standard Reed-Solomon matrix 



modified matrix 



1 1 



eF£ 



G F4 is used as Grs(4,2) since each 
bit of the binary image of a 4-ary symbol is independently 
polarized by e F4. In Fig.|2j there are small differences 



1 1 



of performance between the binary and 4-ary polar codes. In 
Fig. [3] the results of the binary polar codes using Grs(2,2) 
and 4-ary polar codes using Grs(4,4) are plotted. It can 
be confirmed that 4-ary polar codes using Grs(4,4) have 
significantly better performance than binary polar codes using 
G RS (2,2). In Figs. H and [3] the blocklengths are 2 7 , 2 9 , 2 11 , 
and 2 13 viewed as binary codes. 

V. Polar Codes and Algebraic Geometry Codes 
A. Polar codes as algebraic geometry codes 

In HI, Ankan mentioned relation between binary polar 
codes and binary Reed-Muller codes. The relationship can be 
naturally generalized to q-ary cases. In this subsection, we 
overview how q-ary Reed-Muller codes are constructed from 
q-ary Reed-Solomon matrix GRs(q,q) using the Kronecker 
power. 

Definition 5: Let q be an integer power of a prime. For 
any «eN and r = 0, . . . , (q — \)n, the q-ary r-th order 
Reed-Muller codes are defined as {(p(ai),...,p(a q n)) | p G 
¥ q [X u ... ,X n ],deg(p) < r}, where {a u . ..a q „} 
deg(p) is the degree of a polynomial p £ ¥„[X\,... ,X n ], 
It should be noted that the Reed-Muller codes with n=\ are 
also called extended Reed-Solomon codes. 

The binary 2x2 matrix can be regarded as the Reed- 
Solomon matrix Grs(2,2): 

X 

X 

1 

By using the Kronecker product on Grs(2,2), a generator 
matrix of binary 2-variable Reed-Muller codes is obtained as 
follows. 

(X2,Xi):(l,l)(l,0)(0,l)(0,0) 



F" and where 
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0" 
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This method of construction of the binary Reed-Muller codes 
corresponds to the Plotkin construction. We can see that the 
binary expansion of 2" — 1 — i corresponds to the monomial in 
the i'-th row of the Reed-Muller matrix. 

Similar relation also holds for non-binary cases. For exam- 
ple, the Reed-Solomon matrix Grs(3,3) is 



X 2 
X 

1 

A generator matrix of ternary 2-variable Reed-Muller codes 
is obtained by using the Kronecker product on Grs(3,3). 

(X 2 ,X 1 ) :(2,2) (2,1) (2,0) (1,2) (1,1) (1,0) (0,2) (0,1) (0,0) 
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Similarly to the binary case, the ternary expansion of 3" — 1 — i 
corresponds to the monomial in the i-th row of the Reed- 
Muller matrix. These observations imply that polar codes using 
Reed-Solomon matrices can be naturally regarded as codes 
spanned by polynomials. A similar property also holds for the 
case using matrices related with Hermitian codes, as discussed 
in the next subsection. 

Note that the selection rule of rows from Grs(^,^)®" for 
Reed-Muller codes does not maximize the minimum distance 
unless q = 2. In order to maximize the minimum distance, 
rows have to be chosen according to ITLi ('/ + !) where ij 
is the j-th digit of the q-ary expansion of the index i of a 
row. In this paper, we call codes based on the rule which 
maximizes minimum distance hyperbolic codes, which are also 
called Massey-Costello-Justesen codes iflOl and hyperbolic 
cascaded Reed-Solomon codes [11]. On fixed positive rate, 
the minimum distance of Reed-Muller codes is qi n+ °( n > |[T2l 
while the minimum distance of polar codes and hyperbolic 
codes are ^(Grs (<?,<?) )»+<>(») _ Hence, Reed-Muller codes have 
asymptotically worse performance than polar codes in non- 
binary case. 

B. Polar codes on algebraic geometry codes 

In order to obtain large exponents, algebraic geometry codes 
are considered suitable as kernels of polar codes, since they 
can have large minimum distance and often have the same 
nested structure as Reed-Solomon codes. In order to demon- 
strate feasibility of constructing polar codes using algebraic 
geometry codes, we use Hermitian codes as an example. 

Definition 6: Let r be a power of a prime and q — 
r 2 . A function p : ¥q[Xi,X 2 ] ->• N U {0, -°°} is defined as 



TABLE I 

Exponents of Reed-Solomon kernels and Hermitian Kernels 

ON F 2 " . 



m 


2 


4 


6 


8 


£(G R s(2™,2'»)) 


0.573 120 


0.691408 


0.770 821 


0.822264 


£(G H (2 3 "/ 2 )) 


0.562 161 


0.707 337 


0.802760 


0.859 299 



p(0) := -oo, p(X{xl) := ir + j( r +l) and pfoa/Xf'X^) := 
max/ u ,„£op(X 1 'Xf'). F° r any < to < r 3 + r 2 — r — 1, the 
g-ary Hermitian codes are defined as {(p(a\),...,p(a r 3)) \ 
p e F^fX^X^deg!^) < q, deg 2 (p) < r,p(p) < to}, where 
{a\,...a r i} is a set of zero points of Xf — X^—Xi in F ? 
and where deg](p) and deg 2 (p) are the degree of X\ of p and 
the degree of X 2 of p, respectively. 

For a prime power r, a matrix Gh(V 3 ) of size r 3 x r 3 can 
be defined for the r 2 -ary Hermitian codes, just as the Reed- 
Solomon matrix Grs(<?,^) has been defined for the g-ary Reed- 
Solomon codes. In this paper, we call the matrix Gh(/ 3 ) the 
r 2 -ary Hermitian matrix. The Hermitian matrix Gh(8) on F4 = 
{0,1, a, a 2 } is the following. 

(X 2l X l ) : 



(a 2 , a 2 


){a,a 2 
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From [13], the minimum distance of 4-ary Hermitian codes 
can be calculated as (D , • • • ,-D^-l) = (1,2,2,3,4,5,6,8). Note 
that each D, is the largest under given blocklength and dimen- 
sion since the MDS conjecture has been proved for q — 4 lfl4ll . 
However, £(G H (8)) = L(4,8) « 0.562161 < 0.57312 « 
L(4,4). Values of £(G RS (2 m ,2 m )) and £(G H (2 3m / 2 )) are 
shown in Table [I] for m — 2,4,6,8. The exponent of the 
Hermitian matrix GH(2 3m ' 2 ) exceeds the exponent of the 
Reed-Solomon matrix GRs(2 m ,2 m ) for m > 4. The method of 
shortening [3| may be useful for obtaining smaller matrices 
from Hermitian matrices, although the exponent may also be 
smaller. 
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VI. Conclusion 

We have shown error probabilities of g-ary polar codes 
using Reed-Solomon matrices as kernels by numerical simula- 
tions. It is confirmed by numerical simulations that 4-ary polar 
codes using Reed-Solomon matrix have significantly better 
performance than binary polar codes using Reed-Solomon ma- 
trix. We have further shown that kernels with larger exponents 
can be obtained by using Hermitian codes. This implies that 
algebraic geometry codes might be useful as kernels of polar 
codes with large exponents. 



